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ABSTRACT 

This booklet summarizes a discussion of the 
opportunities provided by computerized information-handling 
technology to Improve student achievement in reading and writing. The 
first section discusses the developient and edutfatlonal use of an 
automated dictionary (AD) that would allow a student to designate a 
i»ord by typing it and receive information about it, either aurally by 
electronically generated speech or visually on a cathode ray tube 
screen. Also discussed- ar* ihe possibilities of incorporating an AD 
In a word processor and the larger contexts in which these systems 
would fq-nctton. The second section outlines the technical 
implemetitatlon of computer hardware and software in the classroom. 
Tile thlpd section describes the kinds of scientific research In 
lexlco^aphy, linguistics, psychology, and education that will be 
required to realize the full potential of such syatems. The original 
proposal made to the National Institute of Education suggesting the 
feaslbllitry of ADs is appended. (AEA) 



♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦ V********* ***************************** 

♦ Beproductions supplied by 'EDRS are the beat that can be made ♦ 

♦ from ^h« original document, ♦ 
♦♦♦♦♦♦♦♦♦♦♦♦{►♦♦♦♦♦♦^♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦♦***************** 



AUTOMATED 
DICTIONARIES, 
READING AND 
WRITING ' 



Chairman's Report 
of a 
Conference on 
Educational Uses of 
Word Processors with Dictionaries 

June 14-15, 1979 

4 

George A. Miller 
Chairman 



lOUCATlON A WILPAKI 
NATIONAL iNtTiTUTf OP 
■ DUCATION 

THIS DOCUMENT MAS ftE E N REPRO- 

OUCfO fXACTlV AS RECEIVED FRQM 
THE Pf^RSON OR ORGANilAT ION ORIGIN- 
ATING IT POINTS OF VIE W Oil OPINIONS 
STATfO DO NOT NECESSARILY REPRE- 
SENT OFFICIAL NATIONAL INSTITUTE OF 
EDUCATION POSi^iON OR POl iCV 



U.S. Department of Healthy Education, and Welfare 
Patricia Roberts Harris, Secretary 
Mary F. Berry, Assistant. Secretary for Education 



National institute of Education 
Michael Timpane, Acting Director 

December 1979 



AUG irm 



The New Information-Handling Technology* 



Wo have cnlorothi rovolutioiiufy ago in Ihc technology ol inloriuimoii handhng 
The micrt>elocta)iiic rovoUnioii. extensively iliKinuentctl in tht S«Rteml>er 1«)77 
issue otScientU'ic American, is making ,>ossible a roughly tenfold dcj^^'ase each 
five years in the cost of the integrated circuits that are at the heart of conteni|>orary 
digital computers There are available for individual use. at relatively low prices. 
«)phisticated calculators and computers and means ol generating images on video 
tubes: 

Handheld calculators providing the four arithmetic functions and square riH,t 
now sell for ab*)Ut $10. and the cost of calculators providing trigon.)metric and 
logarithmic functions has declined to the same level. C'aleillators able to handle 
4(K) step programs are now at the $10() level 

Handheld instructional devices providing drill and games in aritlunetic ($15 
25) and in spelling, with a simulated human voice ($65). are selling well 
Personal computers which include a TV tube display for letters, mimbcrs. and 
graphics; keyb«)ard. processor, a si/cable memory, and weigh less than 50 
Lunds are now available for as little as $600. Accessories include wonJ- 
processing devices allowing Rexible construction and editing of written text alTd 
production of typed copy. These computers arc comparabje m capacity to 
computers costing hundreds of thousands of dolla* a decade ago. and arc only 
the first of many that will become available at decreasing prices or with much 
greater capability at the same price. Incorporation of microcomputers irtto V 
sets could make substamial computer power available in the home to each child. 
A videodisc system coming onto the market for $700 provides Ihc follov^ng. in 
conjunction with a regular TV set: 

• 54 000 separate frames of full color picture image, or alphanumeric or 
computer information, on one side of a disc similar in size and cost to an LP 
reco'rd The 54.000 frames correspond to 30 minutes of video at 30 frames 
per second, any. part of which can be played at regular or reduced speed. 

• Random access (dialed in more expensive mod<;ls) to each of the frames, 
which are individually numbered and can be viewed individually for any 
length of time desired and may contain one quarter of a page of easily legible 
text (or a full page with special high, re.solution video tubes). 

Intelligent videodisc systems under development Include a computer which 
allows controlled sequencing of frames, which may be based on student 
, responses as well as the possibility of transfering-computer programs from the 
disrto the computer. Video discs can store 10 billion bits of information 
(Encyclopedia Britannica contains 2»billion bits, a human chromosome has a 
capacity of about 20 billion bits, and the human brain perhaps 10.000 billion). 
This immense storage capacity could probably include on a single disc all the 
computer courseware ever published. 



♦Reprinted" ftx)H? Testing, Teaching and learning. Chairmen's Report of a 
Conference on Research on Testing, National Institute of Education, October, 
1979. 




FOREWORD 




The National Institute of Education ( NIE), the primary federal agency 
for educational research and development* sponsored the conference 
reported on here with the aim of exploring opportunities provided by the 
new igformation-hafdling technology (see box) to improve student 
achievement in reading and writing. 

Factors contributing to this opportunity include the sharply declining 
cost of information-handling technology, including word processors; 
improved knowledge and techni'cal capacity for handling strings of 
charpcters and words with computer logic, for comparing them with a 
lexicon of the English language and drawing inferences on their syntax and 
meaning; and modem inQL>rmation-processing models of human per- 
formance and learning. 

The specinc stimulus for this conference was a paper prepared by 
George A. Miller at an NIE-sponsored conference on Testing, Teaching 
and Learning, August 11-26, 1978. It is reprinted here as an appendix to 
this report. 1 . * 

[ Arthur S, Melmed 

Office of Special Studies 
Nationaf Institute of Education 
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INTRODUCTION 

In* August 1978, George Miller proposed to the National Institute of 
Education(NIE) that automated dictionaries (see Appendix) might serve a 
variety of educational purposes. In March 1979, Arthur Melmed asked 
Reddy to undertake an analysis of the feasibility and cojjt of such a device. 
The preliminary analysis, prepared by M.S. Fox, D. Rebel, and A. Pftrker 
of Carnegie-Mellon University revealed that retrieval systems incorporat 
tng modem computer technology could easily make automated dictionaries 
• widely available. On June 14-15, NIE sponsored a small conference in 
New York at the Rockefeller University to explore further the educational 
value of automated dictionaries. George Miller chaired the meeting. 
Present were: 

Mark Aronoff, Stale University of New York at Stony Hroi^k 

John Parker Damon, McCarthy- Towne School, Acton, Mass. 

Lawrence T. Erase, Bell Laboratories 

John M. Mays, National Institute of Education 

Arthur Melmed, National Institute of Education 

George A. Miller, The Rockefeller University 

Seymour Papcrt, Massachusetts Institute of Technology 

Rnj RedSy, Carnegie-Mellon University 

Sylvia Scribner, Center for Applied Linguistics 

' The group discussed a wide range of topics. Subsequently, Miller 
prepared a document summarizing the sense of the meeting. That 
preliminary version was circulated to the participants, whosoiadditions and 
corrections are reflected in the present report. 
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GOALS 



Advances in computer technology have reduced the cost of powerful 
microprocessors until small businesses, schools, and even private indi 
viduals can now afford to purchase them. As one facet of its resjx^nsibilily 
to evaluate educational implications of new technology^ NIE is interested 
in possible applications of these new systems to education in general^ One 
aspect of this general interest is the possible application of such systems in 
the teach^ig of communicative skills based on what Miller had called 
automated dictionaries. 

Automated Dictionary 

The sihipiest'system that .would qualify as an automated dictionary 
(AD) would allow a user to designate a word (eg,, by typing it) and receive 
information about it (e.g., its dictionary entry), either auditorily (e.g., by 
electronically generated speech) or visually (e.g., on a cathode ray tube). 
Although the original purpose of the conference was to advise NIE 
concerning practical goals for the development and educatiohal use of AD 
systems, the group also discussed larger contexts in which such systems 
might function. 

fVord Processor with Dictionary 

Seymour Papert persuaded the group that, if the ultimate aim is to 
facilit|ite the acquisition of communicative skills (particularly reading and 
writing), the most useful device to give, students is a worcl proc^tor (see 
box). For an adult writer, revising and editing a draft text is ah importapt part 
of the process of writing Children seldom Tkave that experience. Their first 
draft is almost always their final text; i|i a pcncil-papcr-erascr technology, the 
sheer meehanical messiness of it makes revision difficult A word 
processor could remove an obstacle to one of the central skills of writing. 
Editing is simple on a good word processor and the product is legible, 
attractive, and easily stored, scored, or duplicated. Moreover, inasmuch as 
lexical knowledge is a component of writing and reading, the acquisition of 
Ie\ipal knowledge should not be drtifically separated from the mastery of- 
the mote general skills. , 

The focus of discussion broadened, therefore, to 'include the possibility 
of incorporating an AD in a word processor. Two uses of word processors 
with dictionaries (Wl^/D) were cohsidered. 
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Commercial WP/D Systems 

Rjy Reddy suggested that, given a choice between some existing word 
processor and the same system plus lexical information, most peopfe would 
prefer the WP/D system. If his evaluation is correct, manufacturers of 
woal prwessors could probably be persuaded to make their prixlucts more 
attractive by the addition of a lexical retrieval system, and some of the 
development costs of an educationally useful device would be borne by 
commercial users. 

Moreover, once the lexical feature became available on business 
machines, less expensive versions would probably be offered to hobbyists 
for home computers. Conference participants familiar with the history of 
computer technology were confident that, if WP/D is commercially 
successful, it will eventually become available to educators if they arc 
simply willing ih wait for it. 

/ 

Educational WP/D Systems 

It is unlikely that a WP/D system designed for offices would be optimal 
for classrooms. The size of vocabulary and the detail of lexical entries 
might be very different in an educational system. Moreover, an educational 
version should be light, sturdy, and attractive to children, its operation 
should not disrupt other classroom activities, and it should offer a variety of 
language games that would be inappropriate in most ofTices. Some 
participants in the conference argued that the appropriate classroom 
system is a general purpose cornputer; word processing (with or without 
lexical resources) is but one of many educational uses of these versatile 
machines. Discussion at the conference, however, was confined to AD and 
WP/D systems. 

An appropriate goal for NIE would be to establish a continumg 
■ program of evaluation of educational AD and WP/D systems in 
appropViate social and educaUortal contexts. In order to effect such a 
program it will be necessary to encourage the design, construction, and 
programming of AD and WP/D systems, to support research intended to 
establish pedagogical principles to be incorporated in them, and to support 
stadies of their use in representative educational environments. Active 
coordination by NIE will be required, iince three independent disciplines 
will have to cooperate: computer science, lexicographjj, and psychology. 
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In order to^ssess the educational value of AD or WlVD systems it is 
necessary to have one or more of them to experiment with. Such systems 
can be implemented with existing hardware if we are willing to make 
whatever compromises are necessary in the data base and the programming - 
in order to get them opei^ting*. It is recommended that such instantiations of 
the concept be undertaken as soon as possible, but with full realization that 
improved models will evolve as experience accumulates and as further 
research can support technical improvements in both the hardware and 
software. 



Hardware 

Although WP/D systems open up different educational applications 
from an AD alone, the two systems pose similar technical problems— in 
both, the central problemMs how lexical information is to be stored and 
retrieved. The major difference between them seems to be that a handheld 
AD is feasible <at the present time, whereas a similarly small and self- 
contained WP/D probably is not. Since the problems are so similaf, the 
following discussion of hardware is limited ta the itfiplementation of an 
experimental AD system. / 

One way to realize an AD would employ a videodisc. A videodisc has 
54,000 frames; each frame could contain one or several dictionary entries 
(i.g., lists of homographs); a videodisc playback under computer control 
could display any frailte in less than 2 sec. Videodisc systems are currently 
available for experimental purposes— all that is required in order to create 
an experimental System is'for someone to prepare at\d photograph enough 
entries to make t^ie videodisc. f 

Videodiscs might also be usefUl for graphic displays related to 
dictionary entries: for the kinds of illustrations u^ in dictionaries, or for 
maps, portraits, pictures of famous p^ces, or artwork relevant to some 
particular cirriculu|n. Retrieval could be trigwred by the same signal that 
triggers retrieval of written information. Illustrations co^ld be displayed on 
i color TV spatially adjacent to the AD or WP/D display. This 
arrangement would allow experimental comparisons of AD or WP/D 
systems with and without correlated graphic displays. 

Greater flexibility in tl^e size of dictionary is possible with Read-Only- 
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Memory (ROM), Randonv Access Memory (RAM), an external device 
(standard or floppy disk), or, eventually, electro magnetic bubble memory 
Fox! Bebel, and Parker analyzed three categories of AH ^handheld, 
stand-alone, terminals) and estimated the present costs to build one 
prototype model of these syslems (between $2,tK)() and $36XKK), 
depending on size of dictionary, category of device, tyi>c of memory, 
number of terminals). 

Those estimates do not include the costs of software, economies of 
scale, or anticipated reductions in the cost of memory. Fox, BebeK and 
Parker estimated that, with volume prixluction, a'^ClOO word handheld AD 
would cost $2,000 at present and a 10,000- word AD would cost $5 ,000, In 
1988 the costs (in 1979 dollars) would be $200 and $300, respectively: in 
1988 a 30,000- word handheld AD would cost $700. These estimates are 
probably high. Cost reductions are jx^ssible because'the electronic com 
ponents (CPU and ROM circuits) arp expected to decrease in price 
much more rapidly than the nonelectronic hardware (keyboard, power 
supply, display), and because large memory is a dominant aspect of AD 
systems. These economies also suggest that an AD could be added to a 
commercial word processor without increasing its cost sig'^fllcantly. 

Even at these lower prices, however, handheld AD systems will not 
complete with printed dictionaries: an unabridged dictionary ofters half a 
million entries for less than $100, Smaller dictionaries are useful to young 
children, however, and other benefits than lexical information should be 
offered by AD systems. The challenge, therefore, is to process lexical 
information in an AD system in active, useful ways that will be attractive to 
students and teachers. 

With appropriate software, any of the lexical games suggested by 
Miller could be incorporated. The games under-utilize the CPU capabili- 
ties, which suggests that more complex functions could be performed, e.g., 
text editing, sentence parsing. 

Input and output (I/O) devices should be designed to facilitate easy, 
natural interaction between child and machine. We expect that progress in 
achieving interaction will require considerable* imagination and trial-and- 
error. As a starting point, however, light pens, or overlays sensitive to the 
pressure of a finger, might be used instead of keyboards. Acoustic I/O 
should also be considered: although electronic speech recognition is not yet 
reliable enough for general use, it will be; electronically produced speech is 
currently intelligible and can be substituted for written phonetic symbols. 
Speech processing requires memory, of course, and so reduces correspon- 
dingly the storage available for other lexical information. Until circuits are 
perfected and extremely large memories become extremely cheap, 
therefore, speech I/O is probably impractical. However, the possibility 
that a walkie-talkie (handheld radio transponder) may eventually become 
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the preferred I/O device for an AD system should not be lightly dismissed. 

Although a number of interesting hajjdware problems are appaieiU. 
hardware is not a limiting factor in the creation of AD or WP/D systems. 
Cost will limit the market tcitipi^rarily. but by the time satisfactory 
solutions have been found to many software problems, appropriately large 
memories should be available at very reasonable costs. 



Software 

Programs of instructions to c<?Htrol computer hardware are called 
software. We shall use the tem( broadly, however, to include also the 
various assumptions that underlie the choice of processes to be implemented. 

Lexicographie Aspects ^ 

DictionariJ^are published in a range of sizes, where size (and price) 
depends on the number and length of lexical entries. Customers might 
prefer all the information in an unabridged dictionary, but they usually 
settle for a cheaper, more convenient college or pocket edition. Instead of 
Funk and Wagnalls New Standard or Merriam-Websters Third New 
International (about 450,000 entries in 3000 pages, where an entry may 
include many different senses and subsenses), Ihost people buy -the 
American Heritage Dictionary or the Merriam- Webster s New Collegiate 
Dictionary (about 150,000 entries), or an even smaller pocket dictionary 
4\about 50,000). Still smaller dictionaries are published for beginning 
students. In an alphabetical listing, of course, the more entries there are the 
longer it takes to find the entry you want; then you must search through the 
alternative senses for the one most appropriate in the given context.. 

Physical size and the amount of information available are also 
correlated for handheld AD systems, although th^ correlation is reciuged 
somewhat by the CPU and display, which need not change in size as the 
number of memory circuits changes. Since the density of ROM circuits is 
expected to double every 18 mc^. over the next decade, the correlation 
between physical size and information available should be attenuated even 
further in the future. The correlation between retrieval time and 
information available will also hold in automated systems., but retrieval 
shottkl be fast enough that even the longest delays would not be 
bothersome. In AD or WP/D systems where several terminals timeshare a 
CRW and lexical memory is centralized, the physical size of a user s device 
is already independent of the amount of information available, and retrieval 
delays depend on the timesharing system, not on the number of lexical 
entries, ^ ^ 
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Willi {he (ladilional si/c inlormation ticlay coiiclations sus|XMulcd, wc 
necil to ictonsidei how many lexical ctitries are desired or desirable ' I lie 
more llic beller' is a iiaiural answer I he words jxiople wanl lo UxA up are 
usually Ihe rare, unfamiliar ones thai can be found onl^ ui large 
diclionaries. It is a lexicographic fact, however, that entries in large 
dictionaries are more sophisticated than are entries in small dictionaries, 
both because the entries in small dictionaries are shorter and because 
simpler words are used in the dcrinitions; it is a criterion of le^ographjc 
excellence not to use a sense of any word in a definition unless that sense is 
itself defined in that dictionary. If we were to opt for an unabridged 
dictionary, therefore, it would not be intelligible to many children. Bui if we 
were to opt for a small dictionary with simple definitions, it would be of 
little use to many adults. The educational level of the user is an important 
parameter in automated systems, just as it always h^s been in printed 
<jHctioiiarics. 

Oiven unlimited memory, the simple solution would bt to include 
both - adding 50.000 redundant but short entries io a data base that 
already has 450.000 sophisticated entries is a relatively nuxlest increment. 
In a system with two dictionaries, a command to DEFINE a word might 
first produgiJ./i siinple version, if it existed, a commaW to EXPAND the 
definition could lead to the sophisticated version. ILkpcrimental assess- 
ment jtSuld provide data as to wj^h users requested ^xpanded definitions 
for >*4iich words. 

Vn an educational WP/D system, it would be instructive to provide 
RAM circuits adequate to hold a few thousand entries that students could 
use to wrfte their own dictionaries. (% should be expected that students 
would discd^r a variety of other uses for such a retrieval system.) this 
facility might also be useful in commercial WP/D systetns to accomodate 
the specialized vocabularies of different occupational and professional 
groups. 

Lexical entries characteristically give spelling, part of speech, pronun- 
ciation, some etymology, a listing of alternative senses with usage notes for 
some. Entries may also include illustrations, or contrast a group of near 
synonyms. Extensive abbreviation is often used, and much thought has 
been given to typographic conventions. The question arises whether all this 
information should be retained, or whether entries should be recast in a 
form more convenient for computer processing, or more suitable to 
particular educational purposes. 

If the goal is to evaluate prototype educational WP/D systems, 
arguments to begin with the traditional form of dictionary entry are 
reasonably persuasive. Whereas typing a dictionary is aj^ expensive task, 
machine- readable dictionaries already exist (Sherman. 1979). An elab- 
• orate characlter generator for a CRT display is not particularly expensive. 
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and until we have had some experience with these systems we have have 
little Idea what educational purposes we might hope to achieve by 
modifying the way entries are written. A novel type of dictionary entry may 
emerge from the development of these system^, but that should not head 
our list of problems initially. On the other hand, the use of existing 
dictionaries raises problems of copyright; can pennission be obtained to 
use copyrighted materials for exper!m|gntal purposes? 

'Existing dictionartes4ist many homographs. * Palm/ for exiimple^ takes 
three entries; one for the tree, one for the part of the hand, and one for the 
verb. Moreover, each entry has several senses, and some senses have 
subsenses. If a user merely indicated an interest in 'palm/ the device would 
have to display all three entries. The more entries that must be displayed, 
tlie larger tjie display screen should be, and the longer it takes to locate the 
particular sense ofsubsense appropriate in the given context. Little is kno>f 
about] the ability of small children to make such selections accurately. 
^It would be economical, both of display size and of time, if the .AD 
could perform some of those selections for the user, In order to do so, 
however, the device must Have additional inpUt information. For example, 
if a student were reading a story in which 'palm' occurred on line 17, he 
might type PALM 1 7 and see displayed the particular sense that the author 
or a te acher had selected in advance as appropriate in the context of line 1 7. 
Alternatively, if the device is used to display a story that the student wants 
to reai, the machine could scan the context of *palm' for other worlds that 
could be used to determine which sense seemed most appropriate. The 
feasib ility and value of such refmements is a proper question for empirical 
invest gation. 

F( r writing with a WP/D system, some kind of thesaurus organization 
woula probably be valuable (Sedelow, 1976). Roget's Internationml 
TTiesaurus, for example, provides a conceptual hierarchy six. levels deep. 
At the sixth level are groups of words sharing a given sense or concept. The 
internal structure of the thesaurus can be represented abstractly as a 
.matrix, wfiere rows represent words and columns represent lexical 
concepts (Bryan, 1973); cells of the matrix are I if the given word can 
express that concept, and 0 otherwise. Thus, a row gives the^concepts a 
word can express and a column gives the different words that express the 
same concept ( that occur together at the sixth level of the thesaurus). From 
such matrices it is possible to construct paths between semantically related 
groups of words. Given a thesaurus in this form, an author in search of a 
better word might ask, not for definitions, but for near synonym^lof a word 
4 already thought of; those could be displayed in a menu format; touching one 
of them would produce a similar display of its near synonyms, and so on 
until an apposite wond was found. It should be possible to key a thesaurus to 
a dictionary, so that if an unfamiliar word appeared in the course of a 
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thcsaiii4ls search, its appropriate sense Irom the dictionary could be made 
avnilable. ^ 

Synonymy is not the only interesting relation between words, I he value 
of incorporating other relations e.g., antonyms, superi>rdinate and 
subordinate words, free associations, rhymes, word laHiuencies should 

* be explored empirically. Precisely hlw this kind of information should be 

collected, how it should be represented in a lexical device, and wha^^ 
comm{{nds should be available for using it are all matters for further study. 
The first syst(^ms to be tested will adopt the simplest and most convenie?K 
form, but the results lihoujd establish a bascluie against which any 
advantages of future additions or refinements can \}e measured. 

Although dictionaries and thesauruses were the primary focus of 
conference, the data base conception need not be lip^Hcd tiTwords, On line 
data bases of d^tes, geographical indexes, and companions to literature, 
poetry, and the theatre, provide a means of exploring deeper questions in 
,various subject matter areas. The dellnition of "revolution, ^ for insta^ices, 
comes alive and remains fixed in memory when the student can easily go on 
to explore the French, American and Russian Yevolutions, These expanded 
mental contexts support further learnings. Studies of the conditions and 

I strategies that impel student explorations of complex knowledge will 

provide important data on intellectual development and on the effective 
structural characteristics of different subject matter areas. 

It should be recognized, however, that the ii^lusion ol all these kinds of 
intlbrmation in a unified database will pose a substantial development 
problem, since millions of bytes of data will be involved, and since it must 
be retrievable both alphabetically and taxononiically (i.e., via related 
concepts). The data base management system will hava to be carefully 
designed in advance, for modifications and injprovements will be very 
difficult to make in the middle of th^ task. 

Linguistic Aspects 

At the conference it was noted by several participants that adding a 
^ ,syntactic parser to a word processing system might also have educational 

applidations. In particular, Lawrence Frase spoke of text processing 
systems" able to operate intelligently on phrases, sentences, or even longer 
chunks of text. Thus, the concept of a text processor with dictionary 
(TP/D) was invented, but its instantiation and evaluation was not set as a 
present goal for NIE. Rather, a TP/D system might be thought of as the 
distant goal toward which work can be expected to move in the future. 

Psychological Aspects 

J 

In his proposal for an educational AD system. Miller (Appendix) 
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assumed thai, in addition to displaying dctlnitions. a variety o( games and 
pii//les should be provided to make the use ot the system enjoyable to children 

• Spellmg tests, where words are written at dictation (eithei dictation 
by a teacher, oi by the system itself) and scoied by the system 
lexas Instruments is already nuirketing a version oi this called 
Speak & Spell. 

• 0 A string of lelkMs is displayed and the student judges whether they ■ 
form a' word, ^le word recognition task has been widely used bv.^ 
psychologists to explore how lexical knowledge' is organized in a 
IKMSon s memory (Miller, h)78a) 

• Anagrams, where a string of letters is displayed (peihaps m the 
form o|; a familiar word) and the student makes as many words as 
|H>.ssible using those lettds. CMiildren s metlhKls of solving ana 
grams have been found to provide a useful index of their linguistic 
and cognitive development ( Schniederman, Reber. and Mainline. 

• A definition is displayed and the student tries to guess what the 
word is. 'I his procedure can induce the tantalizing tip of the tongue 
experience (Brown and McNeill, 1966). 

• A., picture is displayed and the student names it. The latency and 
eilors are known to be inversely rekited to (he familiarity of the 
object (Oldflcid and Wingfleld, l%5). 

• Two words are displayed and the student tries to Tind the shortest 
path between them through the thesaurus (Quillian, 1968)/ 

These are merely examples. A wide variety of word games have been 
compiTed by WrHiam mid Mwy Morris, aiwl many oi them could be 
adapted to AD or WP/t) systems. In many cases, accuracy and latency 
information can be (unobtrusively) collected and compared with, experi- 
mental results published in the psycholinguislic literature. 

Such games and puzzles have as their goal to make a child interested in 
words." A focus on individual words, however, can interfere with the 
thought processes involved in reading or writing. Conference participants 
speculated thai the reason they do not use a dictionary every time they are 
uncertain of a word^s meaning is not that it takes time to find the diet j,ohary 
and turn to the right page; the real reason is that using a dictionary 
presupposes a preoccupation with words, rather than with the message that 
the words were intended to convey. It^is only when a text becomes 
indecipherable that they turn to a diclionary for help. If this introspective 
appraisal is correct, automating dictionary lookup will not automatically 
increase dictionary use during reading or writing. The possibility remains, 
however, that lexical information would be more useful while study ing than 
while reading for pleasure, and more useful while editing than while 
composing a first draft. In any case, it would be Instructive to collect all the 
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words thai any particular child had called lor Such lists could be 
iiicorjH)ratcd into gtjiiies or puzzles lor that child, or they could signal a^ 
need for teacher intervention. 

Several developmental psycholinguists have undertaken research on 
the- growth of vwabulary in children. Most of this work has involved 
presch(H)l children, since the learning piiKess gLes so rapidly that it is 
dinicult to keep track of what 'older children knmv, I'arly research 
concentrated on vwabulary size and rate of growth: during the past decade 
more interest has l>een expressed in the underlying structure of lexical 
knowledge, i.e., the relations between words that lellect a child s 
conceptual knowledge. Carey (1978) describes the learning process as 
having two phases: ( 1 ) a rapid phase in which a new word is identified and 
assigned to some general semantic domain, followed by (2) a slower 
working out of the specific distinctions among words assignee^ to the same 
domain. Almost nothing definite is known ciinceriiing conditions that 
would facilitate learning of either ty|K\ although it is plausible to sup|H)se 
that they might be diflerent. At the present time, therefore, the main 
contribution to the {^resent project that might be expected from this branch 
of psychology is a number of research workers interested in pursuing the 
topic. 

Educational Aspects *' 

Obviously, any system that is intrtxluced into the schi>ols should be 
matched to the interests and abilities of the children who will use it. 
Developmental psychologists have devoted some attention to age cate- 
gories for toys, and manufacturers of electronic games that are played m\ 
the family television set have also given some thought to thiji question. But 
most of the experience we have in making computers accessible to children 
has been based on mathematics (jsee, e.g., Papert. 1979). Papcrt has foUnd 
that powerful computers combined with programming languages that are 
well conceived and easy to use can smooth a child's entrance into a 
computer culture. In some of his exploratory studies, a WP was integrated 
with a mathematical component in several ways; the two.can be mutually 
supportive in the acquisition of skills needed for interaction with a 
computer. The results indicate that children can use a WP, but further 
studies are clearly needed. 

In conversations with ArJ)iur Melmed, Dr. Allen Newell of Carnegie- 
Mellon U niversity offere^ the opinion that the most critical factor affecting 
tKe success of an AD system for children would be the ease and naturalness 
of th« child-machine interaction. The younger the child, the truer this 
observation is likely to be, but even with 10-year olds, who can compose 
interesting essays and type them on a keyboard, the ease and naturalness of 
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uueraclion will probably \m: of greater iinporiaiKe than inanv ot the lexical 
or linguislic aspects of Ihe system s operation 

A well designed system must not only be attractive lo childien. it nuisl 
also appeal to teachers and fit the educational system Machmcs ni the 
schooluH^n have a ptu^r record, and nUroducing a new one even one 
designed to be hUraclive t%o children may not be easy l-ollowmg the 
conlerence, Parker Damon provided the ivHowing list ot rea.sons why 
schools have been slow to accept any technology more, complicated than 
the biH)k, the chalkboard, and the ditto machine: 

1 Many new devices are promoted before the educational need Ihey 
are mccling ha^ been established in the imnd ol the teacher. 

2 Many of them are too eomplieated and loo expensive to use on a 
wide scale. (Remember that as a rulcfnduslry spends 9 times more 
on the equipment inside a plant than on the building housing the 

1^ . equipment. Schools do the reverse. Salaries and fixed operating 

' costs consume WX> of a schix^l's budjiel. All other supplies, 
consumable and reusable, contend for the remainder. ) As a result of 
this topsy turvy allocation of rcstuirces, schcv^ls arc unfamiliar with 
the task of helping teachc/s acquire the interest, knowledge, and 
ability nced(^ to use- new equipipcnt and procedures successfully. 

3. Some new devices appear at first glmice to be similar to something 
that was tried before without success, a f rill to make the teacher's 
job easier, or a move away froiu the essentials of sound education. 

4. A few devices arc not used because of technical dirtlcultics. The 
typewriter is hn example, Typewriters are noisy, heavy, +)ulky, 
breakable, difTicult to maintain. ' 

Damon sees no reason why an educational WP/D system triust fall prey 
to these problems, but both The Bysterta and the mftnn^r of introducing il will _ 
have to be carefully designed to av6id a deluge of dilTicultics, 

Along with/being mechanically and technically ca&y to use by teachers 
and students, and being recognized as n^ce^sary or helpful to the learning 
process, the WP/Dmustalso be recognized and responded to for what it is, 
before it is introduced. The WP/D will upset the status quo of existing 
school practice. The roles and relationships ol^achers and students will be 
^changed. School districts which initially can afford to adopt the WP/D will 
be set apart from others that do not, 6r cannot, do so and thereby subject 
themselves to social and political pressures that should be anticipated. 

There are at least two schools of thought regarding how best to 
introduce computer technology into schools. One point of view advocates 
thorough inserwice preparation for the faculty before the equipment arrives. 
The topics for such an inservice program might include: philosophical and 
pedagogical implications; comprehensive training in the use of the 
equipment; extensive exploration of ways to apply the equipment's use to 
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Ihc curricuKiin; exaniinahon ol Iho everyday problems lhal may he 
encountered. Tho other school ol (hought advocates (hat the inservico 
program accompany the inlroduclion of (he equipment into the schiH>l 
Both views agree lhal Ihe schiH)rs commilmeiu lo inservice shmiki he both 
extensive and enduring. 

Problems ol ' design nuiy be easier lo solve than problems ol 
implementation. According to Damon» il the lirst generation i>f*\VlVI) has 
the folK)wing characteristics^ it and the ones to lollow may find quicker 
acceptance by the schoi)ls 

1. Inexpensive; $100 or less per unit that ^Mie studcMU uses. 

2. Easy to operate by a teacher; preferably easy to operate by a third 
grader. 

3. Reliable: durable, relatively n\ain\Miance free, repairable, 

4. Portable: easy to move abiHit and sit^M enough to fit on a student's 
desk. ^^^"^ 

5. Quipt: docs not present the classnnmi with either a visual or aural 
distraction. 

6. Magical and useful: does something that caiuiot now be donV* easily 
and slTould be done frequently. 

Il is not likely that the first models that will be rigged up to explore the 
usefulness of these systems will be optimal tljat would be like asking the 
Wright brothers to begin by inventing the C\)ncorde. Nevertheless, 
important advances have recently been made in easing indeeVl erasing - 
the barrier between users and computers (see, e.g.. Bok, 1979), and the 
pt^ssibility of exploiting some of these new techniques should not be 
overlooked. 

Whereas ih^ goal of a commercial WP/D would be to produce better 
writing, the goal of an educational WP/D would be to* V>rixluce better 
writers. The difference can be illustrated by the question of how the system 
should respond to mistakes. For example, if a user misspells a word, should 
the system ignore it« signal that a mistake has occurred, refuse to continue 
until it is corrected* or correct it unobtrusively? If spelling were corrected 
automatically, proper names, foreign expressions, and unusual words 
would show up asiiiisspellings. In an educadotial system, teachers would 
presumably want ih& system to give whatever response c6uld be shown to 
tedch children the most about spelling while not inhibiting writiftg. 

Since the^numbeir of lexical entries that can be stor(5d in a handheld AD 
system Will be limiled, al lea^ initially, educators may wish to dc^velop 
special word-books for particular academic subjects (e.g., for teaching 
foreign languages), and to abandon the goal of casting a general dictionary 
in that form. A single device, perhaps with interchangeable cassettes, could 
serve a variety of school subject, and might be maVket^d as aut()mated 
glossaries acci||npanying particular textbooks. Even small vocabularies in 
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a handheld AD might be iisctui to bilingual stiidei>ls l>aiticipaius at the 
conference disclaimed any superior foresight concerning the best use of 
such devices; their advice was to make something available and hope dial ii 
would inspire users lo explore its {Kiienlial applications and lo suggest 
needed improvements. 

Some educators may resist this kind ol teaching device on philosophical 
grounds. Language is so uniquely human, so important socially and . 
fiersonally. that to learii it from a machine may seem objectionable, if not 
abnormal. On the other hand, children are going to be exfwscd to machines 
of this type after they leave schat>l. the transition should be easier if they arc 
already familiar with them.We suggest that any children involved in the 
initialiests should be at least ten years old and free from any indications of 
autism- their social and conversational, skills should already be well 
established in a human context -and that any teachers involved should be 
screened for a positive attitude teachers who objected to su^h syste^iis 
could unconsciously communicate that a^itude to their studc^Jj^ 

Because we have so little experience with language-based coiii^tei s in 
the classrwm, the development of AD devices would benefit from early 
and continuing information on user system interactions. Such information, 
at the outset, might be largely descriptive. Classrooms would be selected in 
Which teachers were favorable to the project and students were cultiirally 
and scholastically diversified. Observations, recorded by eye and hand, as 
well as videotape, could document spontaneous activities with the device 
generated by individual children or groups of children; the conditions under 
which they generate these activities; the'nature of children's participation 
in teacher-irittiated and self-initiated encounters, and their reactions to ^ 
these learning episode!?. Sijtiilarly, teacher uses of the device could be 
recorded with special,,att^ion paid to the development of new (unan- 
ticipated) uses and to the integration of computer games and lessons into 
the currictihim. InittaHy, the reseafcb «im would ht to produQe naturaX^, 
histories of AD try-outs that could serve two purposes: help designer? ' 
make more informed choices from among alternative hardware and 
software features: begin to identify characteristics of classroom situations, 
students and teachers which seem to affect utilization. 

As imprt)ved devices become available and knowledge accumulates 
about how these work in the classroom, NIE might plan a larger-scale, 
systematic study of experiences, with one or more systems. These 
• descriptive studies would, .in turh, lay the basis for experimental 
^ interVerttions and evaluative reseailbh designed to test the impact of the 
devices in various classrooms and/their learning poteptiarfor individual 
children. The model of educationar research we have in mind is that of the 
"agricujfural station" which participates in the development of new 
programs and techniques as well as in their experimental test. 
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^ Although a scientific base adequate for an initial implementation of AD 
or WP/D systems already exists, further research will be required m order 
to realize the full potential of sucff systems. The following discussion 
covers some of the more obvious lines of research that seem relevant, but it 
is far from exhaustive. . 

Lexicographic Research 

' The systems considered here can be viewed as special applications of 
the results of a new field of lingifistic research that some have called 
computational lexicography. A typical project in this field might ca|l forthe 
automatic disambiguation of English text, i.e., for the identification of the 
intended sense of each word in cortnect^d text. For example, given the 
sentence, 'The pig is in the pen," the system should automatically assign to 
'pen' the meaning enclosure, not writing implement. Amsler and White 
(1979) would track this relation through lexical entries, e.g., one sense of 
•pen' is 'enclosure for animals;' one of the senses of pig is 'young swine,', 
and one sense of swine' is mammal,' and mammal' is ^animal;' thus an 
intobection is found and the appropriate sense of pen' in this sentence is 
identified. In order to conduct this search. Amsler and White i system 
deals with individual senses, an(J for a given dictil^nary (the nouns m^the 
Merriam-Webster Pocket Dictionary in woric completed to date) cpn- 
structs a tangled forest of the hierarchical trees indicating which senses 
occur in the definkions of whIcH other 8eii84s.<P»tr insult resemWes^ * 
thesaurus. Sedelow (1 976) b«s proposed a similar program.' except that she 
has uken Roget's Thtsaurus as (he data base. 

iFa letter to Arthur Melmed, Robert F. Simmons (with W^ P. 
Lehinann, a principal invesUgator in the Dictionary Project reported by 
Amsler and White) comment«!d that "The Uxonomic data base will immedi- 
ately support • system that can teach the hierarchiJW structure of English 
^Uh each definition. "I suspecl that many variations of this teaching game 
can be discovered with some thought, and it will surely generaUze to the 
teaching of foreign languages." Collaborative research between these 
lexicographers and cognitive psychologists could quickly establish edu- 
^cational uses for this hierarchical data base.' 
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Linguistic Research 

Id recent years generative^ iransfonnational linguists have heciMiic 
increasingly interested in the role of the lexicon, l or cxampfT^^ an 
interesting convergence of syntactic and lexical theory appears in the 
recent work ol j^an Biesnan x\\ MM Biesnan ( 1978) has proposed that a 
"lunclionar* structure l>e included in the lexical entry lor each verb, and 
that this runctional structure provide a direct mapping iVoni ilie logical 
argument structure of a verb into its vaiiou.s syntayic contexts. Ihe 
functional .structure tor the verb hit.* for example, wouki be something like 
NP, HIT NPm which maps the logical structure v HH r. into such 
syntactic contexts as The arrow hit <he target* or Tlie target was Int.* A 
parsing system capable of identifying noun phrases and prcjxxsilional 
phrases, and recognizing which pluases serve lis logical arguments of 
which verbs, Jjhould go a long way toward extracting the underlying logical 
argument structure of a sentence. 

I'or longer stretches of text, .some Imguisis (e.g.. Longacre. 1968: 
Grimes, 1972) have proposed lechniqlies of discourse analysis. As 
Sedelow |K)iniS out. almost all of the categories that liave been proposed f^ 
di.scourse analysis depend on an understanding of word meanings. It might 
be possible to automate discourse analysis, but pnly in conjuncion with 
some version o\ an automated dictionary. 

\ 

Psychological Research 

It might be of some practical value for psychologists to establish age 
norms for performance on the various word games and puzzles that will be 
used. Work on estimation of vocabulary size, more or less abandoned for 
twenty years, might be revived, with educational goafs explicitly defined. 

Sqchl^search is of less intellectual interest, however, than the attcrrrpti^ 
to characterize the general structure of pqoplcs' long-tcmi memory for 
words, and especially the studies of how such structures growyn children. 
For example, is a named category of objects learned by learning a IfSt'of 
criterial and characteristic properties, or is it learned by identifying a 
prototypical ^nstance and then judging other instances on the basis of their 
sin>ilarityH*j4jjy)j;ototype? To what exteflt does learning the meaning of a 
word depend on acquiring a repertoire af appropriate motor responses to it? 
Are the alternative sert.ses of a'word conceptually distinct, like a list, or is 
there some abstract gore nreaning that is particularized dilTerently \n - 
diderent contexts of use? How do we recognize metaphorical uses of a 
word? And so on and on. These and many other similar questions are 
currently being .studies by cognitive psychologists. Although the answers 
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coulit;Hclp to shaiK a ihcory of what chiMicn shoulil be learning from their 
inlcractions wilh AD oi WP/D syslcrus. the design ol such systems ni the 
near luture will have topiocWd without the benefit ol such a tlKV)iy It is to 
be hoped that these new systems, and the extensive data they will enable us 
to collect, will contribute lo the development of better cognitive theoi ies ol 
semantic knowledge and ultimately to bottei^^dueation 

A dilTcrent line of psychological research might view these systems us 
merely*onc more instance of the growing list of machines that our society 
expects children lo interact wilh. How are children .interacting wilh the 
electronic and mechanical devices we already have? What are the 
psychological consequences of inequitable access to devices considered to 
be desirable? Do, children \vho have access lo logically complex devices 
solve problems dilTerently from children vyho do not have such access * 
Members of the conference had thou own convictii>ns about these matters, 
but they were sup|X)iled by anecdotes, not by scientific evidence 



Educational Research 

The proposed evaluation is. of course, an example of educational 
jwwsearch.. Of special importance to educators will be information on 
whether teachers of reading and writing are willing to rely on such systems: 
on the attitides of parents and taxpayers toward such developments: and on 
the (possibly inequitable) elTects of home systems for the eilucalion of 
children from wealthier families. 

Little concern has been expressed in this report for the social, moral, or 
cihieai implications of *t)iis educational methodology, kul thqse are 
questions of centrarconcern to many educators. It may be desirable or 
necessary to impanel an (idvisory group chartered to review those aspects 
of the work before any substantial federal funding is proposed. 



Comment tin^esearch 

Improved hardware for these lexical systems is not the first priority. 
Existing technology is already adequate for an initial evaluation of AD and 
WP/D systems: NIE can play an important role by simply coordinating 
resources already available and activities already under way. The principal 
need is the development and testing of new software, broadly defined. If 
large federal expenditures are needed, it is to be hof>ed that, in addition to 
NIE. the Advanced Research Projects Agency of the Department of 
Defense or the National Science Foundation might also become interested. 

It is possible that a project of this kind could have worthwhile 
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consequences for research.^even if new funding is not forthcoming. 
Recognizing what information is needed in order to construct and improve 
such practical systems can sometimes fwus and clarify resea^J^- goals. 
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RECOMMENDATIONS 

F 



The following recommendations to the National Institute jrf|iducalion 
.w out of the discussion summarized in the preceding pages: 



1 . The NIE should coordinate a continuing program of exploration of the 
educational use of AD and WP/D systems in appropriate swial and 
educational contexts. 

2. In order to promote the development of educational system^ worth 
Exploring, the NIE should seek private or federal funding for; 

a. The design and construction of handheld AD with as Jarge a 
vocabulary as possible at the present time; special attention should 
be paid to the attractiveness of the device to young children. 

b. The design and construction of an experimental WP/D system for 
test with older children, with several terminals and a large 
vocabulary stored in the central processor's RAM circuits; 
facilities should be provided for experimenting with the f6rm and 
organization of lexical information. 

3. In oixler to coordinate the above activities, to foimulate various 
subgoals more specifically, and to draw other interested persons 'into the 
planning of this work, the NIE sfiould sponsor additional meetings and 
conferences as they deem necessary and appropriate.. 
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APPENDIX 



Automated Dictionaries* 
by George A. Miller 

A dictionary is a treasury of information about a language. It contains 
information about spelling, pronunciation, etymology, syntactic category, 
meanings, accepted usages, even pictures, for thousands and thousands of 
different words. It is valuable for both writers and readers. It is also 
valuable to teachers and students, because it is a codified representation of 
a llirge body of linguistic knowledge that literate adults are expected to 
have. 

Nevertheless, most adults would probably agree that they do not use 
dictionaries as often as they should. One reason is that referring to a 
dictionary interrupts the flow of thought. It takes time, first to find the 
dictionary, then to find the word, thpa-4o select the desired information 
from the lexical entry. Few adults— and even fewer children— have the 
sustained attention span required to bridge such interruptions. 

If there were sufTicient value in the project, the automation of a 
dictionary would be well within the power of existing technology. It would 
be possible, for example, to type a word into a computer keyboard and have 
the machine pronounce it. Any information stored with that word could be 
^displayed on a CRT almost instantaneously. The tedious task of searching 
through an alphabetical listing is one that machines can perform much 
faster than humans. A central computer could easily serve a large number 
" of classrooms. Let us refer to such a system as Automated Dictionary 
Moder i, or AD-1 for Short. 

Let us imagine that AD-1 is available and consider whether Any 
interesting research projects might result. The most obvious question to ask 
is whether anyone would use it, or wheUier it would be as Neglected as its 
printed progenitor. User studies might suggest ways to make the system 
easier to refer to, or more accessible to mor* people. It is likely, however, 
that Uie system would be ignored unless special, efforts were made to 
educate people in its value, or special tasks were devised to use the system 

in integral ways. , 

' A second line of research, therefore, would l?e to develop interesting 
and educati^Jkal tasks in which AD-l played a useful role. Some examples: 

•Reprinted from Testing, Teaching and Learning, Report qf a Conference on 
Ruearvh on Testing. National Institute of feducation, October, 1979. 
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• Spelling. A word Is pronounced and the student types it into the 
keyboard. If the spelling is correct, AD-1 confirms it; if not, various 
responses might occur, ranging from non recognition to corrected spelling 
suggestions (based on tabulations of the most frequent sjpellifl| mistakes). 

• Word recognition. Letter sequences could be presented and the 
student asked whether or not they spell an English word. Al> I could serve 
as referee in student competitions. 

• Anagrams, the student could be challenged to form as many words 
as possible from a given collection of letters, and given access to AD- 1 to 
settle dubious instances. (Scrabble, word pyramids, etc.) 

• Word search. The dictionary definition of some sense of a word 
could be given and the student challenged to think of the word. Guesses 
would be typed into AD-1, which would display the definitions of that 
word, until a matching definition was found. The definitions mig^it be 
.provided in the form of clues for the solution of crossword puzzles. 

• Picture naming. A picture from the dictionary might be displayed 
and the student challenged (as in word search) to think of its name. 

These and other dictionary games could be explored for their intrinsic 
interest to children and for their educational value. From a computerized 
record of a student's use of AI>-1 it should be possible to assess ability in 
spelling, vocabulary size, or knowledge of multiple meanings and, if 
desired, reduce those assessments to comparisons with other students or 
with external criteria. 

A third line of research would concentrate on the development of more 
sophisticated models. Two alternatives suggest themselves a priori. One 
line would explore programs for using the dictionary data base in a variety 
of different ways of presenting* tasks and l^eeping records. A second line 
would explore additions to the data base of various kinds. 

- As an example of the first kind, it might prove desirable to be able to key 
the dictionary to particular texts that children were reading. If a new word 
appeared in the text^^e teacher might wish to display only that sense of the 
word that was appropriate in the context, thus savings children the difficult 
(ask of selecting the appropriate sense. Speoial software would be required 
to enable the teacher to ihake such modifications fn the standard display. 
Or it might be desirable to edit the dictionary i|i other ways— simplifying 
definition, inserting more pictures, giving more sentences illustrating 
usage, deleting etymology, or whatever— which would also require ^pecial 
software. Or the various tasks (exemplified above) might be presented by 
the system, rathcfr than the teacher, which would also complicate the 
system. - ' 

Such revisions, leading to what we might call A]>2, should be 
und^rtajcen in close collaboration between systems programmers and 
educational research workers familiar with the potential uses of the system. 
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It is not impossible to imagine elaborate and ex|>cnsive soltwaic 
innovations that would have no real va^^^r to teachers or students 

Versions that might be called AD- 3 sWmld consider additions to the 
data base itself. Perhaps the most obvious Example would be to add a 
thesaurus to the dictionary, thus providing cross references between 
lexical entries tlyit are valuable in searching for a|»posite words. Alphal^etic 
versions of Roget s thesaurus are probably w^iiliiate to supjKift initial 
explorations in this direction at the present timej' 

A request for information tropi a thesaurus elicts a set of words. 
Computers can do interestin^jJtiings with sets that would be quite laborious 
to do by hand. They can.TSr example, find the intersection of two .sets. 
Thus, it should be possible to type two words into AD 3 and have displayed 
only those words that are listed under both entries thus sharply narrowing 
the range of .search. 

Various teaching-testing tasks might be set on AD 3. l orexample. two 
words might be selected with nonintcrsectiong sets of as.si-)ciates. and the 
student challenged to find the shortest path between them. A trace-of the 
student s exploration should provide an interesting picture of his or her 
sensitivity to alternative meanings of words. Once the path was found, the 
student might be asked to return to the dictionary and select the particular 
senses of each word that are involved in the path. (It might also give the 
student an intuitive feeling for the theory of finite sets.) 

Dictionaries do not itemize hyponyms. That is to say. under both 'table' 
and 'chair' they will give the superordinate term, 'article of furniture.' but if 
you rook up 'furniture,' you will not find 'table* or 'chair.' (The exception is 
crossword dictionaries, which specialize in hyponyms.) This information 
could also bemadded to an automated dictionary (at least, for nouns). In 
s6me cases— animal and plant taxonomies, for example— this lexical 
information is part of the subject matter of some courses. 

However, the most valuable additions to an automated dictionary- 
grammar aod general knowledge—woutd al^o be the most difficult to make 
in the presJht state of the art. There is something artifical about any use of 
wowis mJfcplation. In order to deal with words as they occur in normal 
communicSion, the system must be able to recognize sentences and to 
assign interpretations to them that are reasonable, given the context and 
our presupposed genera! knowledge about the world and the events that 
occur in it. AD-4, incorporating grammar, and AD-5, incorporating 
grammar and general knowledge, are still beyond the reach of current 
science and technology. Enough progress "has been made in Iheie 
directions, however, that we can at least speculate about such systems and 
their educational Implications. 

It would be possible to add that component of a grammar known as the 
' "phrase structure rules" dii^ctly to the entires in an automated dictionary. 
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simply by aikling to each entry the grammatical contexts m which that word 
can occur. Such a representation would be enormously redundant (the 
niU^&ibr furniingnoun piira&es^ior exaxnple. would b<! repeated in the ent^y 
for every noun), and there arc reasons to believe it could not be complete 
(the transfonnational component of the grammar cannot be caputred 
lexically). The only scheme that is considered plausible at the present time 
is to add gramtmatigal rules as a component separate from the lexical com 
ponen^ in which case the system should probably be referred to as an 
automated grammar, rather than an automated dictionary. 

Relevant evidence might be obtained, however, by a careful analysis of 
children's errors. If those errors can be characterized in terms of syntactic 
classes of words, rather than in terms of individual words, it would provide 
strong evidence for an independent grammatical component and against 
the information of even phrase structure rules in the lexical component. 
Evidence available at the present time suggests that children learn 
^ammatical rules abstractly, not redundantly for each particular word, 
because they often pvergeneralize a new rule. 

An automated grammar (AG-1) is feasible at the present time, given 
the various parsing systems that have been developed by computational 
linguists. These systems have frequently been used in question answering 
systems: the question is parsed and compiled as a program for retrieving 
information from a prestored data base, then answered with a simpliHed 
declarative syntax. None of tbcse systems i^ capable presently of dealing 
with the full range of grammatical English sentences, and most can process 
only a small vocabulary of terms directly related to the data base and the 
questions that can be asked of it. 

AG-1 could support two lines of research, one leading to more 
sophisticated systems, the other exploring its educational applications. 
Along the first line, it would be useful to see v^hether a dictionary could 
provide an interesting data base for a question answering system. Along the 
second line, one would attempt to devise educational tasks using AG-1: 
• Parsing. The student tries to parse a simple sentence (one known to 
be correctly analyzed by AG-I ) by typing its labelled bracketing. If the 
student's parsing is correct, it is accepted; if not, various responses might 
occur— rejection, acceptance of only the correct parts, suggestions as lo 
what is wrong, etc. \ 

^ Syntactic ambiguity. Some sentences can be parsed in more than 
one way. Students might compete to find all correct parses, with AG-1 
serving as referee. 

^ Sentence anagrams. The student could be challenged to form as 
many sentences as possible from a given set of words, ai^d AG-1 used to 
setting whether any disputed ordering was grammatical. 
^ • Privileges of occurrence. T\ie student could be given a sentence with 
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one word missing and asked what words could 1111 the blank: A(i I would 
accept or reject, relative to that sentence, any word that the student ty|>ed. 
The twk should teach a distinction between sentences that are syntactic 
ally Hl-fomied and sentences that are grammatical, but semantically 
anomalous. 

• Editing. AG-1 should be able to do most ol the relatively 
mechanical corrections that teachers make in student compositions. I he 
advantage, in addition to freeing teacher time for instruction that AG I 
could not provide, would be immediate feedback to the student while his or 
her communicative intentions are still operative. 

• Question answering. Questions about words could be phrased in 
standard English, so that the various tasks developed for the automated 
dictionary woul^ not have to be posed in some special code. 

Future developments of AG- 1 might lead to a linguistic equivalent of 
the arithmetic BUGGY system, but we are presently far from understand 
lng9hildren's mistakes well enough to simulate their alternative grammars 
in a computer. 

We are even further from AD-5. a system that could interpret sentences 
in the light of general knowledge about the world. As a metaphor, it is easy 
to speak of adding an encycifftpedia to the dictionary in the system s data 
base, in which case the system should probably be called an Automated 
Encyclopedia (AE-I). Not until the representation of large bodies of 
information in a computer memory is better understood, however, will it be 
feasible to implement such a system. Preliminary reseailh might begin, 
however, with particular bodies of infoipiation relevant to specific 
academic subjects. Question answering systems with a data base on. say. 
the American Revolution, might be experimentally feasible at the present 

time. r . u J 

A full scale Automated Encyclor«dia could be a powerful teacher, and 
might be tpken as a distant goal to work toward. Meanwhile, systems althe 
level of AD-1 to AD-3 could not only support interesting educational 
research, but could provide valuable experience to guide the development 
of more sophisticated systems. 

* U.S.G.r.O. 625-308/1302-1762 . 
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